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OPTIMAL DESIGNS FOR MIXED MODELS IN EXPERIMENTS 
BASED ON ORDERED UNITS 

By Dibyen Majumdar 1 and John Stufken 2 
University of Illinois at Chicago and University of Georgia 

We consider experiments for comparing treatments using units 
that are ordered linearly over time or space within blocks. In ad- 
dition to the block effect, we assume that a trend effect influences 
the response. The latter is modeled as a smooth component plus a 
random term that captures departures from the smooth trend. The 
model is flexible enough to cover a variety of situations; for instance, 
most of the effects may be either random or fixed. The information 
matrix for a design will be a function of several variance parameters. 
While data will shed light on the values of these parameters, at the 
design stage, they are unlikely to be known, so we suggest a maximin 
approach, in which a minimal information matrix is maximized. We 
derive maximin universally optimal designs and study their robust- 
ness. These designs are based on semibalanced arrays. Special cases 
correspond to results available in the literature. 

1. Introduction. When planning an experiment to compare different treat- 
ments, it is important that we carefully consider the possible presence of 
systemic natural differences between the experimental units to be used. If 
such differences are thought to exist, blocking and the use of covariates are 
two methods that may help to increase the sensitivity of the experiment for 
detecting possible differences between the treatments. These two methods 
are at the core of this paper. 

Blocking always leads to a restricted randomization, in which, for each 
block, a selected set of treatments is randomly assigned to the experimental 
units in that block. The use of covariates only leads to a restriction on the 
randomization if the covariates are already used at the design stage rather 
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than just at the analysis stage. If covariates are used at the design stage, 
the designs are often referred to as systematic designs, even though there is 
usually still some opportunity for a restricted randomization. 

Cox [5] considered an experiment involving the processing of wool, using a 
different treatment each week. The natural aging of the wool (which formed 
the experimental units) caused a trend in the units over time and made time 
a convenient and useful covariate to account for systemic changes in the 
experimental units. Cox showed that a systematic assignment of treatments 
to the units that allowed for estimating the treatment differences in the same 
way as without the trend was preferable to a fully randomized assignment 
of the treatments to the units, or to attempting to reduce the effect of the 
trend by blocking the units. 

The covariates that we will consider in this paper are precisely of this type, 
that is, they are based on a natural ordering of the experimental units, typi- 
cally induced by time or spatial location. However, our discussion is entirely 
in the context of block designs, in which each block has the same number k 
of experimental units. In each block, the units are labeled from 1 through 
k, which induces the covariate (or possibly covariates). The designs in this 
paper are relevant if it is thought that units may change gradually across 
this ordering in each of the blocks, although this change may differ from 
one block to the next. This change could, for example, occur as the result of 
a learning process, equipment or product deterioration, or spatial location. 
For some examples and further discussion and references, we refer the reader 
to Bradley and Yeh [1], Chai and Majumdar [3], Lin and Dean [11], Lin and 
Stufken [12], Jacroux, Majumdar and Shah [7, 8] and Majumdar and Martin 
[15]. Lin and Stufken [12] also contains some additional discussion on the 
pros and cons of using systematic designs. 

Thus, we consider the situation where experimental units are partitioned 
into equally large groups of relatively homogeneous units, or blocks, and 
where, within each block, the units are linearly ordered over time or space. 
We will build a model that is more flexible than models thus far considered 
for this situation, and that contains other models as special cases. The model 
will include random block effects (which contains the model with fixed block 
effects as a special case), random trend effects that may differ from one block 
to the next (which contains fixed trend effects, whether the same for each 
block or varying over the blocks, as a special case) and unit-specific random 
deviations from the trend (motivated by our belief that a smooth trend is 
often not very realistic). The latter is a feature hitherto not used in this 
arena. 

While this mixed-effects model will be very flexible, it will also typically 
contain a considerable number of unknown covariance parameters. Data may 
help to shed some light on these parameters at the analysis stage, but this 
is of little help at the design stage. The determination of an optimal design 
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for estimation of the treatment differences, which is the objective of this 
paper, would therefore seem to be a rather intractable problem since the 
information matrix for the treatment effects will depend on the many un- 
known covariance parameters. We will address this problem by identifying, 
for each design, a "minimal" information matrix for the treatment effects. 
This minimal information matrix, which will be smaller in the Lowner or- 
dering than the actual information matrix for the treatment effects, will 
depend on very few (no more than two) parameters, which are functions 
of the original covariance parameters. It is this minimal information matrix 
that we will maximize over all designs to obtain a "maximin" information 
matrix and an optimal design. We note that our approach is in the spirit 
of Kiefer [9] and Kiefer and Wynn [10], who considered minimax optimal 
designs for models with autoregressive errors. 

The maximin information matrix will be derived in Section 2. After char- 
acterizing and identifying the optimal designs in Section 3 (with proofs de- 
ferred to Section 5), we will investigate the robustness of this process to 
parameter misspecification in Section 4. 

2. Setup and basic results. Consider an experiment to compare v treat- 
ments (i = 1, . . . , v) based on n = bk experimental units that are partitioned 
into b blocks (j = 1, . . . , b) of k units each (p = 1, . . . , k). Suppose the units 
within blocks are linearly ordered over time or space. The collection of units 
can be visualized as a k x b array with rows labeled by units within blocks 
and columns by blocks, while the entries of the array are treatments as- 
signed to the units by the design, that is, for design d, the entry in cell (p, j) 
is d(p,j), where d(p,j) £ {1, . . . ,v}. The design d itself will be viewed as a 
k x b matrix. Under a very general model, our objective is to determine an 
optimal design for comparing the treatments. 

For the model, in addition to a block effect, we assume that there is a trend 
over time or space within each block. If y p j denotes the random variable 
corresponding to the observation in row (unit) p and column (block) j, then 
the model is 

Dpi = A* + r c((p,j) + ftj + Cpj + £pj ? 

where t^ p j) denotes an effect for treatment d(p,j), (3j an effect for block j, 
Q p j a trend effect for unit p in block j and e P j the measurement error. We as- 
sume that the trend Cpj is composed of two parts, one that is smooth enough 
to be approximated by a polynomial in p and another that represents random 
fluctuations from the polynomial. The smooth part, which we assume to be 
linear, will be written as + 7j0(p), where 4>{p) is the linear orthonormal 
polynomial on {1, . . . , k}, specifically, (f>(p) = y / 3/(k(k 2 — l))(2p — k — 1). 
If ip p j denotes the fluctuation from the smooth trend then we may write 
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( p j = + ^j4>{p) +' l Ppj- The slope jj is further decomposed into a fixed part 
(9q) that is common to all blocks and a random part (9j) that may vary from 
block to block, that is, jj = 9q + 9j . Writing (3j = Pj +lj and 5 p j = ip p j + e p j , 
we arrive at our model, 

(2.1) y pj = fi + r dipJ) + (3j + 9 ct>{p) + 9j(f)(p) + 5 pj . 

This is a mixed-effects model. The quantities t±,...,t v and 9q are consid- 
ered to be fixed effects while 0j, 9j, 5 p j for p = 1, . . . , k, j = 1, . . . ,b are 
considered random. Although (2.1) combines the two variables ij) p j and s p j 
into one variable 5 p j, at the modeling stage, it may be useful to recognize 
their individual characteristics. Generally, the measurement error e p j may 
be assumed to be homoscedastic, while tp p j, the departure from the assumed 
trend, will depend on the strength, nature and variability of the trend in the 
particular application. These considerations may enable the experimenter to 
determine an appropriate variance-covariance structure for the 5 p j's. 

We assume that the random effects have zero expectations and are un- 
corrected from one block to the next. Let <r^ and Uq denote the variances 
of (3j and Oj, apo their covariance, Vsp and Vse the k x 1 vectors of co- 
variances of 5j = (5\j, . . . ,Skj)' with (5j and 9j and V$s the covariance ma- 
trix of the 5's. Let 1^ denote the k x 1 vector of l's, r = (ri, . . . ,T V )' and 
<j) = ((f)(1) 4>(k))' . If Yj denotes the k x 1 vector of observations from 
block j, then E(Yj) = X dj r + Z 7, where 7 = (/x, 9 )' , Z = (l k , 4>) and X dj 
is the k x v unit-treatment incidence matrix for block j with entries 



Xdj(p,i) 



1, tfd(p,j)=i, 
0, otherwise. 



Also, V(Yj) = S = a}l k l' k + al(l ) <P' + Vs 5 + ape(lk(p' + ^) + (nV^ + V5 (3 l' k ) + 
(4>Vl e + Vsefl). Let Z = (l bk , l b ® <f>) and X d = (X' dl , X'J' , where de- 
notes Kronecker product. The first and second moments of the observations 
Y = (Y{,...,Y b y are then 

(2.2) E(Y)=X d T + Z 1 ,V(Y) = V = l b ®Y, ) 

where I b is the identity matrix of order b. The parameter of interest is r, 
the vector of treatment effects. For a design d, the information matrix for r 
is given by 

c d = x' d v~ l x d - x' d v~ 1 z(z'v~ 1 zy 1 z'v~ 1 x d . 



Remark 2.1 [Special cases of (2.2)]. If Vgg = aoI k with ao > (equiva- 
lently, Vs$ = a$l k + a\l k l' k + a2(p4>' with ao > 0), then the model is equivalent 
to one in which the random trend in each block is known to be linear. If, in 
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addition, <r| = 0, then the model is equivalent to one in which the trend is 
fixed, linear and the same for each block. This has been studied by several 
authors, including Bradley and Yeh [1], Yeh and Bradley [21], Yeh, Bradley 
and Notz [22], Stufken [20], Lin and Dean [11], Chai and Majumdar [3], 
Chai [2] and Lin and Stufken [12, 13]. Alternatively, if = oo, then the 
model is still equivalent to one in which the trend is fixed and linear, but 
now possibly different in different blocks. This has been studied by Jacroux, 
Majumdar and Shah [7, 8] and Majumdar and Martin [14]. For any <t|, if 
cr| = oo, then the model corresponds to one in which the block effects are 
fixed. 

Our objective is to identify a universally optimal design for estimating 
r. For (2.2), Cd depends on the 4 + 2k + k(k + l)/2 variance component 
parameters in E. If these are all known at the planning stage, then the opti- 
mization problem can be solved by numerical techniques. However, this will 
rarely be the case. Our approach, therefore, is to work with few parameters 
at the design stage. To do this, we first identify, for each design d, a minimal 
information matrix C d , and then determine a universally optimal design 
based on the minimal information matrix. This is, therefore, a maximin ap- 
proach. We will derive a minimal information matrix that depends on at 
most two parameters (other than v, b and k), which are functions of the 
original variance components. Once the data has been collected, however, 
we recommend a less parsimonious approach. At the inference stage, the 
experimenter should work with a realistic model with all likely parameters 
included and let the data decide. 

We will use the Lowner ordering to identify the minimal information 
matrix, that is, B y A if B — A is nonnegative definite. Formally, the first 
step is to identify, for each d, a matrix C d such that Cd y C d , where C d is 
an information matrix for the design d corresponding to a simplified model. 
The next step is to find the optimal design d* such that 

C d * is completely symmetric and trace(Cji) = max(C^). 

To get C d , we utilize the representation Cd = X' d Q(QVQ)~QXd, where 
Q = l n — Z(Z' Z)~ l Z' , and observe that a lower bound for C d may be ob- 
tained by using an upper bound for £. To get an upper bound for E, we note 
that in most situations, variances are easier to determine than covariances. It 
can be shown that E = V{5j + (3jl k + 0j<j>) =< W{5 j ) + AV(f3 j l k ) + W{9 j (j)) ■< 
4# max (V a(5 )I fc + 4CT^l fc l / fe + 4of#', where E max (V S s) is the maximum eigen- 
value of V$$. This bound is generally conservative, and we can do better if 
there is additional information. For example, if Vsp = Vgg = 0, that is, the 
fluctuations from the trend Sj are not correlated with the modeled part of the 
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trend 6j and the block effect /3j, then £ < E max (Vss)Ik + + 2cr^( 

Hence, in general, we have 

E d 00% + 000 Mfc + 0O0#'i 
using quantities 0"oe'0ofl and a™ that take values in the intervals 



E max a (Yss) < o\ e < 4£ max a (Vss), (rL < crL < 



00 a < ele < 4a^ a , 

where <x| a , <7g a and i?maxa(^5<5) are the assumed or prior values of erg, er| 
and ^max^ii)) respectively. If the correlations are believed to be negligible 
the values of <7q £ ,(Tq^ and a^g should be taken at the lower endpoints of the 
intervals or close to it, while for stronger correlations, these values should 
be assumed higher. We will see in Section 4 that the optimal designs are 
remarkably robust, so an accurate determination of <7oe>0o/3 an d a oe within 
their respective intervals is usually not necessary. 

Our first theorem gives the minimal information matrix. To state it, we 
use the standard notation for a design d: r d i will denote the replication 
of treatment i, r d = (r dl , . . .,r dv )', R d = diag(r d i, . . . ,r dv ), N d = (n dij ) the 
treatment x block (column) incidence matrix and M d = (m d i p ) the treat- 
ment x unit (row) incidence matrix. Also, let 

(2.3) Aq = — o — — : — n~, A 



0o £ + fc 0o/3 ' ' m± a 0e + °~oe 



Theorem 2.2. The information matrix C d for a design d based on the 
model (2.2) satisfies C d y , where 

(2.4) al £ C L d = j:X> dj WX dj - - M d ^M' d 
with 

(2.5) W = I fc -Aolfclfc-A 1 #'. 
The proof is straightforward and hence omitted. 

Remark 2.3. (i) C% is the information matrix for model (2.2) with 
uncorrelated random effects (Vsp = V$e = 0,0-/30 = 0), V(5j) = V$s = o-Q £ I n , 
V (Pj) =°\ = 0o/3 and V(9j) =aj = oIq. 
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(ii) The minimal information matrix may also be written as 



b 



al e C L A =R d - X N d N' d - Ai ]T X' dj ^'X dj 



i=i 



3. Optimal designs. In this section, we will explore optimal designs for 
model (2.1). Our goal is to determine universally optimal designs using the 
minimal information matrix (2.4), that is, a maximin universally optimal 
design. First, we need a definition. 

Definition 3.1 (Rao [18, 19]). A t xb array in v symbols is called an 
orthogonal array of type II (OAII) of strength 2 or a semibalanced array if 
the columns of the array consist of distinct symbols and any two rows of the 
array contain every pair of distinct symbols equally often. 

For the construction of these arrays, see Hedayat, Sloane and Stufken [6]. 
We will establish the maximin universal optimality of judiciously selected 
designs of the form 



where d\ is a semibalanced array with rows that are uniform in symbols, di 
is a matrix with each row identical to some row of d\ and II is a permutation 
matrix of order k. Note that the rows of a semibalanced array are always 
uniform when there are three or more rows. 

Universal optimality will be established by the technique outlined in The- 
orem 1 of Majumdar and Martin [14]. To establish complete symmetry (as.), 
note that each treatment occurs equally often in each row of d. Hence, for 
a design of the form (3.1), M~d> = and r~r~ is c.s. Also, it follows from 



Cheng [4] that £?=i X'~WX~ is c.s. Hence, C| in (2.4) is c.s. 

For an arbitrary design <i £ D, the trace of the maximin information ma- 
trix is given by 



(3.1) 
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Note that <ft' 'M' d M^ > 0, r' d r d > - — and both of these lower bounds are 
attained by designs of the form (3.1). Since 

(3.2) 0<A <-, 0<Ai<l, 

if there is a design of the form (3.1) that maximizes trace(5^ 3 -_ 1 -X'^WX < ^) 
among all designs, then this is maximin universally optimal. 

Writing, W = (w pq ) we get trace (X' dj WX dj ) = Y, p =\Wpp + 2J2( p , q )W pq , 
where the second summation is over all pairs of experimental units p, q, 
p <q, that are occupied by the same treatment in block j. Since J2 p =i w pp 
does not depend on the design, an order of assignment of treatments to block 
j that maximizes the second sum will maximize trace (X'^WX^j). Note that 
since w pq does not depend on the block subscript j, the pattern that is op- 
timal for block j is also optimal for any other block and, combined, will 
maximize trace(X]j=i X'^WX^j). 

Let O ={ir = (7r(l), . . . , ir(k)) : ir(p) € {1, . . . , v}, p=l,...,k} be the set 
of orders ir. For each order ir 6 O, there is a design of the form (3.1) in 
which 7r is the first block as long as a k* x b semibalanced array based on v 
treatments exists, where k* is the number of distinct treatments in ir. For 
vr G O, let 

P(tt) = {(p, q) : 1 < p < q < k, n(p) = 7r(q)}, 
F ( 7T ) = J2 W PQ- 

p,qeP(7r) 

Note that F(ir) depends on v,k,Ao and Ai, but not on b. 

The trace maximization problem may be stated as, given v, k, Xq and Ai 
satisfying (3.2), maximize F(tt) over all ir € O. An order ir* that maximizes 
F(ir) will be called an optimal order. In the next two subsections, we will 
identify optimal orders ir* . We will distinguish between the two cases k <2v 
and k>2v. The proofs are given in Section 5. 

Before concluding this section, we give an alternate expression for F(n). 
Since for p^ q, w pq = — Ao — \i<j){p)<j){q) , we have 

(3.3) F(tt) = -\ s(tt) - AiT(tt), 

where s(ir) = \P(n)\, the cardinality of P(ir), and T(ir) = X^gePM ^(p)^^)- 
For i = 1, . . . , v , if we denote 

(3.4) rii = rii(ir) = replication of treatment i in ir, 

k 

(3.5) hi = hi(ir) = ^ 6 ip (f>(p), 

p =i 
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where 5i p = 5i p (vr) = 1 if n(p) = i and equals zero otherwise, then it follows 
from (3.4) that 



(3.6) 

Also, since 



s = s 



n;(nj - 1) 1 



i=l 



E 

i=l 



n~ 



2T(vr) = £ %)^(?)=E EM(P) -E^(p)' 
p,g€P(7r) i=l \p=l / p=l 



using (3.5), we get 
(3.7) 



TV) 



E^-i 



i=l 



3.1. Optimal orders when k < 2v. For a positive integer q > k — v, we 
define ir q to be an order of the form 

(3.8) 7Tq = {h,i 2 , ■ ■ .,i q ,i q+ i,.. .,i k -g,i q , . . . ,i2,h}, 

where i\, 12, ■ ■ ■ , ik-q are k — q distinct treatments. We will also write itq for 
an order of k distinct treatments. We define 

k + 1 



(3.9) 



Max< p : p integer, 1 < p < 



,\i4> 2 {p) > A 



0. 



if A!0 2 (1)>A O , 
if Xi<P 2 (l)<X . 



Note that s* is a function of k, Ao and Ai, but not of v or b, and s* < k/2. 

Lemma 3.2. Suppose k<2v. 

(i) If k <v + s* , then tt s * is an optimal order. 

(ii) If k > v + s* , let a = k — v. n a is then an optimal order. 

Using (3.6) and (3.7), we note that ir a minimizes s(tt) over tt € O and 
minimizes T(tt) among all orders that minimize s(ir). A proof of Lemma 3.2 
is given in Section 5. 

3.2. Optimal orders when k > 2v. For given k and v, k > 2v, we define 
integers m and t by 



(3.10) 



k = mv + t where < t < v and m>2. 



Depending on the values of m and t, an optimal order will turn out to be 
either a trend-free (TF) or nearly trend-free (NTF) order. An order is called 
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trend-free if all treatments are "orthogonal" to the fixed part of the trend, 
that is, for each i = 1, . . . , v , 



It is easy to see that a trend- free order can only exist if, for each i = 1, . . . , v, 



When k is odd, any integer rii satisfies (3.12). A TF order can be constructed 
in this case if rii > 2 for all i. The treatments with even replication are 
used at the beginning and at the end of such an order it in such a way 
that 7r(p) = 7r(k — p + 1) for these positions. For treatments with odd rii, 
n i > 3, (3.11) can be achieved by filling the remaining positions using the 
construction of Phillips [17], which is also reproduced in Lemma 3.2(a) of 
Jacroux, Majumdar and Shah [8]. It follows from (3.7) that for a trend-free 
order, T(tt) = —1/2, the lower bound. 

When k is even, (3.12) implies that rii must be even for all i, in which case 
an order n with the property vr(p) =ir(k — p + 1) for all p satisfies (3.11). If 
rii is odd for some i, then it can be shown (see, e.g., Lemma 3.1 of Jacroux, 
Majumdar and Shah [8]) that 



Provided that > 3 for treatments with an odd replication, using the con- 
struction of Mukerjee and Sengupta [16], which is also reproduced in Lemma 
3.2(b) of Jacroux, Majumdar and Shah [8], the lower bound in (3.13) can be 
achieved for each such treatment, while at the same time achieving hi = for 
treatments with even replications. For k even, orders that satisfy (3.11) and 
the lower bound in (3.13) for treatments with even rii arid odd rii, respec- 
tively, are called nearly trend- free orders. Note that for a nearly trend- free 
order tt in which u treatments have odd replications, T(ir) = ^[u(4>(^)) 2 — 1]. 

The trend-free and nearly trend-free orders that are especially relevant to 
our investigation are given in the following definitions. 

Definition 3.3 (Trend- free orders). 

(i) When k is odd, we use h^f to denote any order with the following 
two properties. 

(a) The replications are n± = ■ ■ ■ = n t = m + I, n t+ i = • • ■ = n„ = m, there 
being no treatment with replication m + 1 if t = 0. Treatments with even 
replication occupy the "outer" positions [which are 1, . . . , (m + l)t/2 and 
k — (m + l)i/2 + 1, ...,k when m is odd and 1, . . . ,m(v — t)/2 and k — 
m(v — t)/2 + l,...,k when m is even]. The remaining, "inner," positions 
are occupied by treatments with odd replication. 



(3.11) 



hi = 0. 



(3.12) 



ni(k + l) = (mod 2). 



(3.13) 
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(b) Treatments with even replication are arranged so that 7r FF (?) = ^tf — 
p+1). Treatments with odd replication are arranged using the construc- 
tion of Phillips [17] so that hi = 0. 

(ii) When k is even and k /v is an even integer (so that m is even and t = 
0), we use ir^, F to denote any order with raj = m, i = 1, . . . ,v, and 7r^ F (p) = 
7T^ F (k -p + 1) for p = 1, . . . , k/2. 

(hi) When k is even and k/v is not an even integer, we use tt ff to denote 
any order with m € {£, £ + 2} for alH = 1, . . . , v, where £ is the even integer 
in {m — l,m} and vr^ F (p) = Tr^ F (k — p + 1) for p = 1, . . . , k/2. 

Definition 3.4 (Nearly trend- free orders). When k is even and k/v is 
not an even integer, we use 7Tjvtf to denote any order with the following two 
properties. 

(a) The replications are n\ = ■ ■ ■ = nt = m + 1, iit+± = ■ ■ ■ = n v = m, there 
being no treatment with replication m + 1 if t = 0. Treatments with even 
replication occupy the "outer" positions [which are 1, . . . , (m + l)i/2 and 
k — (m + l)t/2 + 1, . .. ,k when m is odd and 1, . . . ,m(v — t)/2 and k — 
m{v — 1)/2 + 1, . . . , k when m is even]. The remaining, "inner," positions are 
occupied by treatments with odd replication. 

(b) Treatments with even replication are arranged so that itntf{p) = 
ttjvtf(^ — P + !)• Treatments with odd replication are arranged using the 
construction of Mukerjee and Sengupta [16] so that \hi\ = = 

Optimal orders are given in the following lemma, the proof of which is 
again postponed to Section 5. 

Lemma 3.5. Suppose k>2v. 

(i) If k is odd, then tt ff is an optimal order. 

(ii) If k is even and k/v is an even integer, then n^p is an optimal order. 
(hi) If k is even and k/v is not an even integer, then 

iTj>f is optimal if Xi(j) 2 ^—^j > Xq; 
ttmtf is optimal i/Ai0 2 ^— ^ < Ao- 

3.3. Optimal designs. The main result is formulated in Theorem 3.6 and 
is an immediate consequence of the considerations in the previous subsec- 
tions. 

Theorem 3.6. Given v, k and Ao, Ai satisfying (3.2), suppose tt* is an 
optimal order given by Lemma 3.2 or 3.5. Suppose b is such that a k* x b 
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semibalanced array based on v treatments exists, where k* is the number of 
distinct treatments in tt* . Let 

^ n (t) 

be a k x b array, where d* is a k* x b semibalanced array, each row of d% is 
identical to some row of d\ and U is a permutation matrix such that, after 
relabeling treatments if necessary, d*(p, 1) = 7r*(p), p = 1, . . . , k. d* is then a 
maximin universally optimal design. 

Remark 3.7. For the case ape = 0, Vsp = Vgg = 0, V$§ = 00 £ I n , a\^ = 
oo (Ao = 1/k) and a^ d = oo (X% = 1), our results reduce to the results of 
Jacroux, Majumdar and Shah [8]. Therefore, Theorem 3.6 may be viewed as 
a generalization of their Corollary 4.3 and Theorem 4.6. In particular, our 
results extend the results of Jacroux, Majumdar and Shah [13] to models 
with random block and trend effects. Moreover, our proofs are different from 
theirs and arguably less cumbersome. 

Remark 3.8. For the case ape = 0, Vsp = Vso = 0, Vss = c>o £ I n , a^ = 
oo (Ao = 1/&) and a^ e = (Ai = 0), Theorem 3.7 of [3], established the 
existence and optimality of "strongly balanced" BIB designs. Our approach 
can be used to generalize this result to models with an arbitrary a^ > 
(Ao £ (0,1/ k]), that is, models with random block effects. 

4. Robustness. For given v, b and k, the existence and construction of 
the optimal designs in Section 3 may require knowledge of the covariance 
parameters Ao and Ai. An important issue is whether the misspecification 
of these parameters can lead to the choice of inefficient designs. We restrict 
ourselves to the case Ai > 0. As in Section 3, we will distinguish between 
the cases k <2v and k>2v, starting with the slightly simpler latter case. 

For k > 2v, if k is odd, or k is even and k/v is an even integer, then the 
optimal design given by Theorem 3.6 does not depend on Ao or Ai. Hence, 
provided b is such that it accommodates the optimal design in the theorem, 
for these cases, there is no need to specify Ao or Ai to select an optimal 
design. On the other hand, if k is even and k/v is not an even integer, then 
the order for the optimal design in Theorem 3.6 is 

■Ktf if Y < 4? f and ir NT F if ^ > 4> 2 {^j ■ 

Thus, misspecification of Ao or Ai could lead to the selection of Tt^pp in cases 
where the design based on kntf is optimal, or vice versa. How bad can this 
be? 
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Since k is normally rather large for this case, 2 (§) = (2( J 1 )) -1 will tend 
to be small. Therefore, unless Ao is near zero, which corresponds to the case 
of no block effects, the optimal design will be based on ttntf- Moreover, that 
design turns out to be very efficient when it is not optimal. To see this, a 
natural measure of the relative efficiency of the design based on ttntf when 
the design based on n^p is optimal is the ratio 



(4.1) 



Eh 



trace (Cf (ir NTF )) 
trace(C£(7r£ F )) 



where C^(tt) stands for the matrix for a design as in equation (3.1) that 
is based on the order n. When the design based on ttntf is optimal, we can 
use 1/E\ to measure the relative efficiency of the design based on tt^ f . 

Expressions for the two traces in (4.1) can easily be computed from the 
results in Section 3. This leads to 



a\ e trace {C^(-k NT f)) 



b[k-kX -2X s r 



k 



(1 



b k-kX 



o 



2X s mi 
■t)A^ 2 



v 



kX )j - bt\i<ff 
if m is even, 
kX ) 

if m is odd, 



<trace(C^(7r^)) 



b k-kX 



o 



b k-kX 



o 



(1 
(1 



kX )j - btX , 
if m is even, 
kX )\ -b{v-t)X , 
if m is odd, 



where m and t are defined in (3.10) and 
(4.2) s min = -[{v- t)m 2 + t{m + l) 2 



mv 



v + t]. 



Note that E\ does not depend on the value of b. It is immediately clear that 
E\ is virtually equal to 1 if Ao = 0, so the design based on kntf is often 
optimal and always highly efficient. It is also seen from these expressions 
that the design based on n^p is highly efficient when it is not optimal. 

Turning to the case k < 2v, based on Lemma 3.2, we arrive at an optimal 
sequence of the form Tr q in (3.8), where max{0, k — v} < q < [k/2\. But, if 
Ao or Ai are misspecified, then we could end up selecting a design with the 
wrong value of q. To see how bad this could be, first note that for all values 
of Ao/Ai within each of the following intervals, there is one order that is 
optimal: (0,(p 2 ([k/2\], [cj) 2 (q + 1), <p 2 (q)] for q = max{0, k — v},..., [k/2\ — 1, 
and [(/> 2 (max{0, k — v} + 1), oo). Thus, if the misspecified value and the true 
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Table 1 
Design efficiencies 



A 





1/40 


5/40 


10/40 


10/40 


10/40 


Ai 


1 


1 


1 


1 


1/2 


1/10 


TO 


71 


73 


77 


83 


100 


100 


7Tl 


97 


98 


100 


100 


98 


86 


7T2 


100 


100 


95 


83 


80 


69 



value of Ao/Ai are in the same interval, then the chosen order is optimal. 
Next, observe that 

/ u \ i 

ag £ trace(Cf (tt,)) = b[k - k\ - X 1 - ~(1 - fcA )J + 26^(A 1( /> 2 (p) - A ). 

p=i 

The efficiency of 7r g may be defined as 

E _ trace(Cf (x q )) 
2 trace(C£(7r 9 *))' 

where q* is either s* or a, depending on which order is optimal, according to 
Lemma 3.2. Note that again, E% does not depend on b. It can be shown that 
the efficiency gets smaller as we move away from the optimal order ir q * . We 
will limit our consideration of the magnitude of the efficiencies to a small 
example. 

Let k = 4 and v = 7. Depending on the value of Ao/Ai, an optimal order is 
either ttq = {1,2, 3, 4}, tt\ = {1,2, 3, 1} or tt^ = {1,2, 2, 1}. More precisely, ttq 
is optimal if A /Ai > 4> 2 {l) = 9/20, m is optimal if 1/20 = 4> 2 {2) < A /Ai < 
2 (1) = 9/20 and vr 2 is optimal if A /Ai < 2 (2) = 1/20. Table 1 shows the 
efficiencies (rounded to nearest percentages) of these designs for selected 
values of Ao and Ai- 

The conclusion is that we need to be a bit more careful in this case, but 
that a design that is less extreme (in terms of the value of q) is more likely 
to keep a high efficiency, except possibly for extreme values of Xo/Xi- 

5. Proofs. 

Proof of Lemma 3.2. (i) Suppose s* > 0. For tt <E O and j = 0, 1, . . . ,k, 
let us define 

(5.1) Sj = Sj (7r) = number of symbols that appear j times in it. 

It follows that v = sq + si + h s^, k = s\ + 2s2 H + ks^ and s = s(ir) = 

s 2 + (2) s 3 + ' — I" (2)^- Suppose positions pi, P2 are occupied by symbol i±, 



OPTIMAL DESIGNS FOR MIXED MODELS 



15 



positions p$, P4, p§ are occupied by %i and so on. We can then write 
F(ir) = -A s - Ai[0(pi)0(p 2 ) + 4>(P3)4>{Pi) 

+ 4>(P3)<P(P5) + 4>(p±)4>{P5) H — ] 



(5.2) 



-Aos - y [(<Kpi) + HP2)) 2 ~ ^(Pi) ~ ^ 2 (P2) 



+ (<P(P3) + <P(Pi) + <P(P5)) 
-<j> 2 {pz)-<t?{PA 



r(p 5 ) + 



< -A s + 



Li=l 



where 
(5.3) 

Note that 
(5.4) 



h = h(ir) = 2s2 + 3^3 + • • • + ksk = k — s±. 



s-h/2 = Y^ 

1=2 



1(1-1) 



si>0. 



Hence, we get from (5.2) 
(5.5) F (tt)<-\ ^ + ± 



i=l 



1 



-EfAi^-Ao]. 



i=l 



It follows from (3.9) that an upper bound for F(tt) is given by 

s* 

(5.6) F(tt)<^[A 1( A 2 (p)-Ao], 

p=i 

which is attained by the order tt s * defined in (3.8). 

When s* = 0, F(tt s *) = and for any other order tt, it follows from (5.5) 
and (3.9) that F(jr) < 0. This establishes part (i) of Lemma 3.2. 

(ii) If equality is attained in (5.6) by an order tt, then it is clear from (5.4) 
that h(Tf) = 2s*, Sj(ir) = for j > 3, 52(7?) = s* and si(tt) = k — 2s*. However, 
for k > v + s*, the number of treatments required by tt is k — 2s* + s* = 
k — s* > v so that tt does not exist. To prove (ii), let us start by evaluating 
the possible range of s(ir) for an optimal order tt. Since 1 < k/v <2, it follows 
from (3.6) that s(ir) is minimized when rtj € {1,2}; It can hence be shown 
that Mm n £o s(tt) = a. On the other hand, if s(tt) > k/2, then the trend-free 
order ir e defined by 

e _ ( (l,2,...,fe/2,fc/2,...,2,l), if k is even, 

~ I (1, 2, . . . , (k - l)/2, (k + l)/2, (k - l)/2, . . . , 2, 1), if k is odd, 
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satisfies s(vr) > s( vr e ) = [k/2\ and T(vr) > T(vr e ) = -1/2 = Min^o T(tt), 
hence F(ir) < F(ir e ). Therefore, without loss of generality, we may assume 
a<s(vr) <k/2. 

Suppose W € O is arbitrary with s(w) = a + g, where < g < k/2 — a. It 
follows from (5.3) and (5.4) that s±(w) > k — 2(a + g). Therefore, from (3.7), 
we get 



T(W) 



i=l 



> 



E 



i 



i:n;=l 



>-E^ 2 W- 



Using (3.3), we obtain F(7r) < Max^^^+g F(tt) = 2~2p=i [^i4> 2 {p) ~ Ao]- Since 
(fc+l)/2 > a + 5 > a > s*, (3.9) implies £p=! [Ai</> 2 (p) - A ] < Ep=i[Ai0 2 (p) - 
Ao]. It can be shown that 2~)p=i[Ai0 2 (p) — Ao] = F(ir a ), with 7TQ, as defined 
in (3.8). Hence, F(w) < F(ir a ). This completes the proof of Lemma 3.2. □ 

To prove Lemma 3.5, we need a result that is stated and proved below. 

Lemma 5.1. Let k>2v and k be even. With m and t as defined in 
(3.10) and with u denoting an integer, < u < v , let O u denote the set of 
all orders with precisely u treatments that have an odd replication. Then, an 
order ir G O u with the following properties minimizes X^=i n i over O u : 



(i) when m is even, 
u + t 



2 

u — t 



Sm—1 



S m +1 = U, 



v — u, 



s m+2 
s m+l 



U 



2 

u + t 



(ii) when m is odd, 
v — u — t 



Sm—1 



U. 



Sm+1 



V — U + t 



U + V — t 



Sm+1 = V - U, 



Sm+2 



U — V + t 



Here, the sfs are the quantities defined in (5.1). 



if u<t, 
if u> t; 

if u <v — t, 
if u > v — t. 



Proof. We will first show that an order that minimizes 2~2i=i n 1 m ®u 
(a "minimizing order") satisfies the following: 



(5.7) 



If Sj > and sj 1 > then \ji —jo\ < 2. 



To see this, suppose it is an order such that sj > 0, Sj 1 > for j\ > jo + 
3. Suppose nj = jo and = ji- Let n' be an order obtained from ir by 
only changing two appearances of treatment i\ to treatment iq. For ir' , 



OPTIMAL DESIGNS FOR MIXED MODELS 



17 



= j\ — 2, n' iQ = jo + 2, n! i = for all i ^ io,h, hence %' 6 O u . Clearly, 
ELi " ELi n? = (jo + 2) 2 - jo 2 + (ii - 2) 2 - if = 4(io - ji) + 8 < -12 + 
8 < 0. Hence, tt' is "better" than 7r. For tt', s'j Q = Sj — 1, = — 1, 

4j+2 = Sjo+2 + 1, 4i-2 = s ii-2 + 1 and s j = s j for i ^ O'cio + 2, ji - 2,ji}. 
Repeated application shows that (5.7) must hold for a minimizing order. 

Since k = mv + 1, < t < v, for a minimizing order, we have two possibil- 
ities: 

(5.8) Sj = for j tf: {m, m + 1, m + 2} or 

(5.9) Sj = for j (fi {m — 1, m, m + 1}. 

Suppose m is even, m > 2. Clearly, u and i are even. For tt S O u , if (5.8) 
holds, then s m +i = u. It follows from 

(5.10) s m + s m+ i + s m+2 = v , ms m + (m+l)s m+ i + (m + 2)s m+2 = /c 

that an order tt £ O u with s m+ i = u must satisfy 

u + t £-u 

(5.11) S m = D — , S„,. + i=U, S m+2 = —^— ■ 

On the other hand, if (5.9) holds, then s m _i + s m ,+i = ii. Identities (5.10) 
imply that an order tt £ O u with s m _i + s m +i = u must satisfy 

u — t u + t 

(5.12) s m _i = — — , s m = v-u, Sm+ i = —^- . 

It is clear that when u <t, (5.12) cannot hold and when u> t, (5.11) cannot 
hold. When u = t, (5.11) and (5.12) both reduce to s m = v — t, s m+ i = t. 
This proves Lemma 5.1 for even m. The proof for the case of odd m, m > 3, 
is similar. □ 



Proof of Lemma 3.5. To prove (i), note that since \m — n$/| < 1, it 
follows from (3.6) that it^f minimizes s(tt) and since hi = for i = 1, . . . , v , 
it follows from (3.7) that ttj. f minimizes T(tt). Hence, tt ff maximizes F(tt). 
The proof of (ii) is similar. 

To prove (iii), first consider an order tt € O u . From (3.13), it follows that 



T(tt) > - 



1 



Case 1: Tn is even, in ^ 2. It follows from Lemma 5.1 that when tt £ O u ^ 
for u<t, 



s(tt) > 



1 



u + t 



m 2 +u(m + l) 2 + ^— -(m + 2) 2 -mv-t 



s mm + 



t — u 
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with s min as defined in (4.2). Similarly, for u>t, s(ir) > s min + (u - t)/2. 
Hence, from (3.3), we get, for tt € O u , 



smin + l± Y^> ~ T 



If we denote the upper bound in (5.13) by F*(u), then F*(u) < F*(t) for 
it > t. This implies that Maxo< u <„ F*{u) is attained at some u<t. When 
u <t, 



when A - Ai(/> 2 Q^ < 0, 
when Ao-Ai(/> 2 Q^ > 0. 

The lemma follows since F{%% F ) = F*(0) and F(ir NTF ) = F*(t). 

Case 2: m is odd, m > 3. The proof is similar. It can be shown that when 




F**(u), say. 



2^2 

Maxo< u < t , F**(u) is attained at some u <v — t. When u<v — t, 

|V*(0), when Ao-Ai^fJ) < 0, 

F"(n)< W 

when A o -Ai0 2 f -J > 0. 

Finally, in this case, F(ir% F ) = F**(0) and F(tt ntf ) = F**(v -t). □ 
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